Reconstructing DIOGENE: ITC-irst at TREC 2006

نویسندگان

  • Matteo Negri
  • Milen Kouylekov
  • Bernardo Magnini
  • Bonaventura Coppola
چکیده

Our participation in the TREC 2006 QA task is the first step on the way of developing a new and improved DIOGENE system. The leading principles of this re-engineering activity are: i) to create a modular architecture, based on a pipeline of modules which share common I/O formats, open to the insertion/substitution of new components; ii) to allow for the capability of configuring the settings of the different modules with external configuration files; iii) to provide the capability of performing fine-grained evaluation cycles over the individual processing modules which compose a QA system. Another long-term objective of our work on QA, is to make the core components of the system freely available to the QA community for research purposes. This paper overviews the work done up to date to achieve these objectives, focusing on the description of a prototype module designed to handle the anaphoric questions often contained into TREC QA series. Preliminar evaluation results of the new module are presented, together with those achieved by DIOGENE at TREC 2006.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Linguistic Processing and Web Mining for Question Answering: ITC-irst at TREC 2004

This paper describes the work we have been done in the last year on the DIOGENE Question Answering system developed at ITC-Irst. We present two preliminary experiments showing the possibility of integrating into DIOGENE a textual entailment engine based on entailment rules. We addressed the problem proposing both a methodology for acquiring rules from the Web and a matching algorithm for compar...

متن کامل

ITC-irst at TREC 2003: the DIOGENE QA System

This paper describes a new version of the DIOGENE Question Answering (QA) system developed at ITC-Irst. The recent updates here presented are targeted to the participation to TREC-2003 and meet the specific requirements of this year’s QA main task. In particular, extending the backbone already developed for our participation to the last two editions of the QA track, special attention was paid t...

متن کامل

The DIOGENE Question Answering System at CLEF-2004

This paper presents the ITC-irst Multilingual Question Answering system DIOGENE. The system was used successfully on the CLEF-2003, TREC-2003, TREC-2002 and TREC-2001 QA tracks. DIOGENE relies on a classical three-layer architecture: question processing, document retrieval, answer extraction and validation. DIOGENE uses MultiWordNet [Pianta et.al. 2002] (http://multiwordnet.itc.it) which facili...

متن کامل

Bridging Languages for Question Answering: DIOGENE at CLEF 2003

This paper presents the extension of the ITC-irst DIOGENE Question Answering system towards multilinguality. DIOGENE relies on a well tested three-components architecture built in the framework of our participation in the QA track at the Text Retrieval Conference (TREC 2002). The novelty factors are represented by the enhancement of the system with language-specific tools targeted to the Italia...

متن کامل

Knowledge from Repeated Co - occurrences : DIOGENE at TREC - 2002

This paper presents a new version of the DIOGENE Question Answering (QA) system developedd at ITC-Irst.. Withh respect too our first participationn too the TREC QA m ainn task (TREC-2001),, the system presents bothh improvements andd extensions.. Onn one hand,, significant improvements r ely onn the substitutionn of basic components (e.g. the searchh engine andd the tool inn charge of the named...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006